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GENE EXPRESSION PROFILES 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims priority to U.S. Provisional Patent Applications, Serial 
5 Number 60/156,050 filed September 24, 1999 and Serial Number 60/209,026 filed June 2, 
2000, which are hereby incorporated by reference. 

FIELD OF THE INVENTION 

This invention is related to the fields of gene expression patterns, of identifying 
10 agents with antibiotic function, and of identifying genes responsive to such agents. 

BACKGROUND OF THE INVENTION 

Treatment of infections with antibiotics has led to the development by many 
pathogens of clinical resistance to such antibiotics. In the case of fungal pathogens, 

15 opportunistic fungal infections have become a life-threatening problem for individuals 
with compromised immune systems, and azoles represent a significant portion of drugs 
used to treat systemic infection by fungal commensal organisms. For example, 
fluconazole has emerged as the primary therapy for the treatment of oropharyngeal 
candidiasis in HIV-infected patients (Revankar etaU 1996, J. Infect Dis. 174:821-827; 

20 Rex et al. 9 1995, Antimicrob. Agents Chemother. 39: 1-8). Azoles such as fluconazole 
inhibit Ianosterol 14a-demethylase (Ergl lp), an enzyme in the ergosterol biosynthetic 
pathway in yeast (Turi & Loper, 1992, J. Biol. Chem. 267:2046-2056). The nitrogens in 
the azole ring form a complex with the heme iron component of the cytochrome group 
resulting in the inhibition of the enzyme (Yoshida & Aoyama, 1987, Biochem. Pharmacol. 



06/24/2004, EAST Version: 1.4.1 



WO 01/21839 



2 



PCIYUSOO/26055 



36:229-235). 

Widespread treatment with azoles has led to clinical resistance of many Candida 
spp. and other fungal pathogens (Denning et aL, 1997, Antimicrob. Agents Chemother. 
41:1364-1368; Joseph-Home et a/., 1995, Antimicrob. Agents Chemother. 39:1526-1529; 
5 Moran et aL, 1 998, Antimicrob. Agents Chemother. 42: 1 8 1 9-1 830; Vanden Bossche et aU 
1992, Antimicrob. Agents Chemother. 36:2602-2610; Venkateswarlu et aL, 1996, 
Antimicrob. Agents Chemother. 40:2443-2446). Isolation of these resistant strains has led 
to the intensive study of the molecular mechanisms by which the organism can 
compensate for growth in the presence of these drugs. Clinical isolates of azole-resistant 

10 C albicans have been shown to contain point mutations in the gene encoding Ergl Ip 
(Lamb etal y 1997, J. Biol. Chem. 272:5682-5688; White, 1997, Antimicrob. Agents 
Chemother. 41 : 1488-1494) or an increased expression of this gene (White, 1997, 
Antimicrob. Agents Chemother. 41:1482-1487) leading to decreased susceptibility to 
azole drugs. In addition to altering the target gene, C. albicans has been shown to increase 

15 the efflux of azoles by upregulation of multidrug resistant pumps belonging to the ABC 
transporter and major facilitator families in response to azole exposure, resulting in lower 
susceptibility of the organism (Sanglard et aL> 1995, Antimicrob. Agents Chemother. 
9:2378-2386; White, 1997, Antimicrob. Agents Chemother. 41:1482-1487.). 

Once the genome of a pathogen has been partially or fully sequenced, genechip 

20 microarrays may be constructed containing open reading frames (ORFs) spanning the 
entire genome. For example, following the complete sequence determination of the 
Saccharomyces cerevisiae genome, DN A microarrays have emerged as a powerful tool for 
examining the simultaneous expression pattern of more than 6000 yeast genes (DeRisi et 
a/., 1997, Science. 278:680-686; Gray etal, 1998, Science. 1281:523-528; Jelinsky & 

25 Samson, 1999, Proc. Natl. Acad. Sci. USA. 96:1486-1491; Marton et al. y 1998, Nature 
Medicine 4:1293-1301; Wodickae/ a/., 1997, Nature Biotech. 15:1359-1367). However, 
analysis of the data resulting from the use of such microarrays has heretofore been plagued 
by the need for computation of complex algorithms which limits the speed at which one of 
skill in the art may use such data to identify useful information such as drug candidates 

30 and target genes. 
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For example, Tavazoie et al (1998, Nature Genetics 22:281-285) used the iterative 
A-means algorithm to identify transcriptional regulatory sub-networks in yeast, from 
hybridization data generated from 15 time points, spanning two cell cycles of growth. 
This entailed a highly complex and time-consuming analysis of the hybridization 
5 intensities resulting from microarray screening. 

Alternatively, in Gray et al. (supra), microarray screening was used to identify 
transcripts in yeast whose levels were altered more than two-fold when treated with 
kinase-inhibitory purine derivatives, without subjecting the data to any further analyses. 

In Wodicka et al (supra), genome-wide expression levels were compared for yeast 
10 cells grown in rich and minimal media. RNA level or abundance was determined by a 

quantitative analysis of the hybridization intensities on hybridized microchips according to 
the method of Lockhart et al. (1 996, Nature Biotech. 1 4: 1 675- 1 680). Yet, this form of 
analysis does not allow one to easily distinguish treatment-specific responsive genes. 

Marton, et al. (supra) examined the genome-wide effect of gene deletions and 
1 5 chemical inhibitors of the calcineurin-signaling pathway on gene expression in yeast 
Comparisons between two treatments, or between a treatment and control, were made by 
plotting the log of the expression ratio (control culture/treated culture) for each ORF for 
one treatment or mutation, against the log of the expression ratio for each ORF for another 
treatment or mutation, or against the log of the mean intensity of hybridization for each 
20 ORF. 

The art is in need of a simplified method of analyzing microarray data, in order to 
speed the identification of drug candidates, target genes, and information related to modes 
of action of such drugs and steps in biosynthetic pathways. 

25 SUMMARY OF THE INVENTION 

This invention provides a simplified method of analysis of data generated by 
microarrays permitting those of skill in the art to generate expression profiles of agent- 
treated pathogens. The methods of the invention allow the rapid screening of candidate 
drug compounds, identification of drug candidates predicted to be effective against 
30 particular pathogens, and identification of genes involved in the pathway affected by such 
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drug compounds. 

For example, the present invention is directed to a method of preparing an 
expression profile of genes expressed by a cell treated with an agent, comprising the step 
of plotting the log(ratio) of normalized filtered ADI values resulting from a hybridization 

5 assay performed on a microarray with mRNA extracted from the cell against a sum of the 
normalized filtered ADI values. The method may be carried out with an antibiotic agent 
or an agent which is not an antibiotic agent. Expression profiles prepared by this method 
are also part of the invention. 

The invention is also directed to a method of identifying an agent capable of 

10 altering expression of a gene in a cell, comprising the step of identifying a treatment- 
affected gene from a prepared expression profile, wherein the treatment-affected gene's 
datapoint on the expression profile exceeds one standard deviation from a distribution, as 
well as agents identified thereby. Preferably the datapoint exceeds two standard 
deviations from the distribution. 

15 The invention is also directed to a method of identifying a first agent capable of 

interfering with a biosynthetic pathway, comprising the steps of comparing a plurality of 
first treatment-affected genes identified from an expression profile of a first cell treated 
with the first agent, with a plurality of second treatment-affected genes identified from an 
expression profile of a second cell treated with a second agent known to interfere with the 

20 biosynthetic pathway; and determining whether the number of the first treatment-affected 
genes which are identical to the second treatment-affected genes exceeds a percentage of 
about 20% of the total number of second treatment-affected genes, as well as agents 
identified thereby. 

The invention is also directed to a method of identifying a gene as part of a 
25 biosynthetic pathway comprising the step of identifying a treatment-affected gene 

common to a percentage of expression profiles prepared from cells treated with agents 
known to affect the biosynthetic pathway, as well as genes, in isolated form, identified 
thereby. 

These and other aspects of the invention are described in greater detail below. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the growth curves of £ cerevisiae determined using the 
Spectromax Plus plate reader (diamonds) and shake flasks (circles). 

Figure 2 shows the determination of the minimum inhibitory concentration (MIC) 
5 of ketoconazole for S. cerevisiae strain BY4743. Cultures were grown for 24 hours and 
ODsoo measured in the Spectromax Plus plate reader. 

Figure 3 depicts the relationship between the treatments used and ergosterol 
biosynthesis. Gene names were as listed in Daum et aL 9 1998, Yeast 14:1471-1510. 

Figure 4 shows the expression profiles obtained from this study. Each point 
10 represents the response of a gene to a particular treatment ORFs with Average Difference 
Intensity (ADI) values below 50 ADI units were not included. Plotted is the sum of 
experimental ADI and baseline ADI for each gene (vertical axis), and, on the horizontal 
axis, 

\og(ratio)* = log((j& IBL)!^ f[B!Bb) 

15 Figure 5 shows the genes involved in the biosynthesis of ergosterol and membrane 

components. Genes given in bold type were responsive in the study; italicized bold type 
indicates genes with decreased transcript level. Superscripts indicate the number of 
treatments to which the gene responded. The first number in the superscript indicates the 
number of the three genetic perturbations which elicited a response, and the second 

20 number indicates the number of the eight drug treatments which elicited a response. Lists 
of genes were obtained from the Yeast Protein Database (YPD) (Hodges et aL 9 1999, Nuc. 
Acids Res. 27:69-73) and Daum et al. (supra) and Denning etaL (supra). 

Figure 6 shows the genes involved in biosynthesis and utilization of heme. Bold 
type and superscripts are as described for Figure 5. Lists of genes were obtained from the 

25 YPD database (Hodges et a/., supra). 

Figure 7 shows the PCR measurements of transcripts from five genes: ERG3, 
ERG4, ERGS, ERG6, ERG1 1/16, and ERG24. Shown is the change with respect to the 
internal control, TEF1 . An increase in cycle number indicates decrease in initial template 
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concentration. 

Figure 8 presents comparisons of qualitative gene expressions patterns in response 
to different biological treatments. 18 functional pathways are presented along the x-axis. 
The treatments are presented along the y-axis. 

5 

DETAILED DESCRIPTION OF THE INVENTION 

It will be appreciated by those of skill in the art that the methods of the present 
invention are useful for analysis of many different pathogens, including but not limited to 
bacteria, yeast, and viruses, as well as eukaryotic cells. It is preferred that the pathogen's 
10 entire genome be sequenced, however, partial sequencing still permits the formulation of 
gene expression profiles and would thus be appropriate for the methods of the present 
invention. 

In general, the methods of the invention entail exposing growing cells to agents of 
interest, and analyzing the mRNA transcripts present in the cells, in response to the 

15 exposure, preferably by hybridization to DNA microarrays. Cells are exposed to agents 
and harvested preferably during mid-logarithmic growth phase. mRNA is isolated and 
typically converted to double stranded cDNA, so that amplified and labeled cRNA may be 
generated. The labeled cRNA is then incubated with a DNA microarray to allow for 
hybridization of complementary sequences. After unbound cRNA is removed, the 

20 hybridized material is detected. Detection of hybridized cRNA may be accomplished by 
any known method, preferably by binding of streptavidin-phychoerythrin to the biotin- 
labeled cRNA, and scanning for the phycboerythrin signal. Alternatively, the cRNA itself 
is labeled with fluorescently-tagged nucleotides, during amplification, for direct detection. 
The signals are manipulated by software (in the case of using an Affymetrix microarray, 

25 by the Affymetrix Genechip algorithm (using Affymetrix software)) to yield hybridization 
intensity values (average difference intensity or ADI units) for each gene on the array. 

Affymetrix microarrays consist of oligonucleotides synthesized in situ. Each ORF 
has a number of corresponding oligomers on the microarray, termed a probe set A probe 
set consists of a series of probe pairs: oligomers designed to match the ORF sequence 

30 ("perfect match" or PM), and, for each, a corresponding mismatch (MM) oligomer 
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designed to serve as a hybridization control. The Affymetrix Genechip algorithm 
computes the hybridization signal, termed Average Difference Intensity (ADI), for the 
probe pairs whose difference lies within three standard deviations of the differences of 
other probe pairs from the same probe set. This exclusion of outliers is done to minimize 
5 false signals due to bright specks or non-responding probe pairs. The ADI value for each 
ORF is the sum of (PM-MM) differences for the probe pairs used divided by the number 
of probe pairs used in the calculation. 

To obtain ADI values above the chip background, ADI values from the 
experimental data sets less than or equal to 50 are eliminated (filter A). A second filter is 

10 then applied requiring baseline ADI to be greater than 50 (filter B). 

Prior to analysis, ADI values are normalized, i.e., edited to replace values of 1 or 
below with a value of 1 . These include negative values that arise from ORFs in which the 
"Mismatch" signal is greater than the "Perfect match" signal, and reflect non-expressed 
ORFs. The actual value of these negative ADI differences is not pertinent to the analysis 

15 of expressed genes, and replacing these values with 1 eliminates subsequent editing 
required to remove errors generated when attempting to divide by 0 or to calculate the 
logarithm of a negative number. Following this normalization procedure, the 
Experimental (E) ADI values are compared on a gene-by-gene basis to the ADI value of 
the ORF under unperturbed conditions (termed the Baseline (BL) value, the arithmetic 

20 average of ADI values for that ORF derived from six replicate data sets from mid-log 

phase growth). Two values are computed for each ORF: (1 ) the sum of edited ADI values 
for that ORF from Experimental and Baseline data sets, and (2) the log of the ratio 
(log(ratio)) of Experimental to Baseline ADI values for that ORF. 

To compensate for differences between chips in a data set, the E/BL ratio for each 

25 ORF is divided by the geometric mean of the E/BL ratios for that chip. Dividing by the 
geometric mean has the effect of centering the log(ratio) distribution on zero and 
eliminating any small variance between chips within a data set prior to genome-wide 
analysis. 

Thus the values (composed of Experimental (Ej) ADI and BaseLine (BL§) ADI 
30 values) used for analysis are, for each gene; of the n genes on the chip: 
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sum = Ei + BL 

\o%(ratio)i = Iog((£7 Bb)/tffl&/ BL*) 

Comparisons between data sets are preferably performed visually using Spotfire 
Pro 3.0 (Spotfire Corp., Cambridge MA) and numerically using Microsoft Excel. Data are 
5 imported into Spotfire, binned by standard deviation, and displayed graphically. The 

graphical representation of this data, an example of which is shown in Figure 4. represents 
a novel method of formulating an expression profile, and is useful for all the methods of 
the invention. 

In one embodiment of the invention, the log(ratio) (as defined above) for each 

10 ORF is plotted against the sum (as defined above) for each ORF, yielding an expression 
profile such as that shown in Figure 4. As a control, in a preferred embodiment, such 
profiles may be prepared from untreated cells as well as from cells treated with potential 
active agents, and the profiles compared. 

In a preferred embodiment, the datapoints on the expression profile are considered 

15 in light of their standard deviations from the distribution, as is also shown in Figure 4. 
Preferably those ORFs showing greater than one standard deviation from the distribution 
in either the positive or negative direction are considered indicative of up- or down- 
regulated genes, respectively, and are thus identified as treatment-affected genes. More 
preferably, those ORFs showing greater than two standard deviations from the distribution 

20 in either the positive or negative direction are considered indicative of up- or down- 
regulated genes, respectively (i.e., treatment-affected genes). 

In another embodiment, the expression profiles are used to evaluate potential 
active agents which may or may not belong to a class of agents known to interfere with a 
particular biosynthetic pathway. Cells are treated with agents known to interfere with the 

25 biosynthetic pathway, and expression profiles are created as in previously discussed 
embodiments. The expression profiles are compared in order to identify those genes 
which are up- or down-regulated by the active agents. In a preferred embodiment, those 
genes whose expression is most affected by the active agent (e.g., show greater standard 
deviations from the distribution) are identified as treatment-affected genes. Preferably, the 
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most affected genes show standard deviations in excess of one standard deviation, more 
preferably in excess of two standard deviations. 

Expression profiles from cells treated with agents whose effect is unknown may 
then be prepared and compared to the expression profiles for active agents known to 
5 interfere with biosynthetic pathways, in order to determine whether the agent whose 
function is unknown would be predicted to have the same or similar effects as the known 
agents. In a preferred embodiment, an unknown agent whose expression profile shows up- 
and down-regulation of some of the treatment-affected genes allows one of skill in the art 
to conclude that the unknown agent is likely to be effective at interfering with the 

10 biosynthetic pathway in question. Preferably, the agent whose function is unknown can be 
said to be likely to have similar effects as the known-active agents if the treatment- 
affected genes identified in the expression profile of the unknown agent are some of the 
same treatment-affected genes identified in the expression profiles of the known agents. 
Preferably, if about 20% of the treatment-affected genes identified in the expression 

15 profile of the known active agents also are identified as treatment-affected genes in the 
expression profile of the unknown agent, it can be concluded that the unknown agent is 
likely to exert substantially similar effects on the pathway involved as the known agents. 
More preferably, the percentage of treatment-affected genes common to the expression 
profiles of both the known and unknown agents allowing one of skill in the art to conclude 

20 a substantially similar effect on the pathway is about 30%, more preferably about 40%, 
more preferably about 50%, more preferably about 60%, more preferably about 70%, 
more preferably about 80%, more preferably about 90%, and most preferably about 100%. 

In another embodiment of the invention, the expression profiles prepared as 
described above are used to identify the genes affected by a panel or class of agents. 

25 Several members of a class of agents with known activity against a particular pathogen are 
used to treat target cells, and expression profiles are prepared as described above. The 
treatment-affected genes are then identified for each expression profile. Preferably, the 
treatment-affected genes are identified as having standard deviations in excess of one 
standard deviation, more preferably in excess of two standard deviations. 

30 In this document agents are considered any composition of matter, preferably small 
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molecules, that can do the following: they might inhibit the growth of a target cell, they 
might prove to be biologically active either enhancing or ameliorating a particular disease 
state. Agents can also produce nonspecific effects, such as producing general 
physiological stress. As used here, for example, an antibiotic is an agent that inhibits a 

5 microorganism. The microorganism may be a eukaryotic or prokaryotic cell, it can 

include yeast and fungus, bacteria, protozoa. Here when an antifungal agent is referred to 
it would be an antibiotic agent that inhibits the growth of yeast or fungi. 

In a preferred embodiment, the treatment-affected genes identified for the 
expression profiles of each active agent are compared, and treatment-affected genes 

10 identified as common to at least about 25% of the class of active agents tested are 

identified as class-specific treatment-affected genes. Preferably, treatment-affected genes 
identified as common to at least about 40% of the class of active agents tested are 
identified as class-specific treatment-affected genes. Preferably, treatment-affected genes 
identified as common to at least about 55% of the class of active agents tested are 

15 identified as class-specific treatment-affected genes. Preferably, treatment-affected genes 
identified as common to at least about 70% of the class of active agents tested are 
identified as class-specific treatment-affected genes. Preferably, treatment-affected genes 
identified as common to at least about 85% of the class of active agents tested are 
identified as class-specific treatment-affected genes. Preferably, treatment-affected genes 

20 identified as common to at least about 1 00% of the class of active agents tested are 
identified as class-specific treatment-affected genes. 

In a preferred embodiment, the class-specific treatment-affected genes are 
considered regarding their role in a biosynthetic pathway. Where a gene or several genes 
are involved in the same pathway, it may be concluded that it is likely that the class of 

25 active agents exert their effects on that particular pathway. This method thus also aids in 
identification of the mode of action of a class of agents. 

In another embodiment, the invention is directed to the expression profiles of yeast 
treated with the azole and imidazole compounds described above. 

Other embodiments of the invention will be readily understood by those of skill in 

30 the art 
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The present invention is more fully described using the particular example of yeast, 
although the methods used for other organisms would be essentially the same. 

Ergosterol is an essential component of yeast plasma membranes which affects 
membrane fluidity, permeability, and the activity of membrane bound enzymes (Cobon & 

5 Haslam, 1973, Biochem. Biophys. Res. Comm. 52:320-326; Daum et aL, supra; Parks, 
1987, CRC Crit. Rev. Microbiol. 6:301-341). InS. cerevisiae, ergosterol is also a major 
component of secretory vesicles and has an important role in mitochondrial respiration 
Paum et al, supra; Parks, supra; Zinser et al y 1 993, J. Bact. 1 75:2853-2858). Ergosterol 
has also been predicted to play a role in oxygen sensing (Smith & Parks, 1997, Biochem. 

10 Biophys Acta 1345:71-76), defined by the well-characterized sparking function of this 
sterol (Lorenz et al. y 1989, J. Bact. 171:6169-6173; Rodriguez et a/., 1985, Biochem. 
Biophys. Acta 837:336-343). Genes in the ergosterol pathway demonstrate 
transcriptional regulation in response to a mutation in other ERG genes and resulting sterol 
limitation (Arthington-Skaggs et aL y 1996, FEBS Lett. 392:161-165; Kennedy et al. y 1999, 

15 Biochim. Biophys. Acta. 1445:1 10-122). Other genes such as CYB5, COX3, RPL27 
among others have been shown to contribute to altered sensitivity to azoles when 
overexpressed (Launhardt et d/., 1 998, Yeast, 14:935-942; Truan et al, 1 994, Gene. 
142:123-127). Thus, the analysis of the genome-wide response to ergosterol perturbation 
may reveal novel mechanisms for resistance, or new sites for chemical intervention, in 

20 addition to increasing understanding of the cellular response to perturbation of ergosterol 
biosynthesis. 

The methods of this invention were used to reveal a convergent pattern of gene 
expression between drug-treated cells and cells with genetic alterations in the same 
targeted pathway. From the data generated, a set of genes can be identified that is 

25 predictive of an "ergosterol response." Other responsive genes have been identified that 
offer insight into the relation of ergosterol biosynthesis to important physiological changes 
in the cell. Additionally, informative transcriptional changes were observed that can be 
used to identify potential targets for chemical intervention. The invention also provides 
methods by which genome-wide transcriptional changes in yeast can be analyzed 

30 following exposure to a drug with an uncharacterized mode of action. 
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Inhibitors of ergosterol biosynthesis such as the azoles are a major class of 
antifungal therapy. Disruption of ergosterol biosynthetic genes mimics phenotypic 
changes observed during exposure to these agents, thus lending evidence to the agents' 
mode of action. Using Afiymetrix microarrays, gene expression patterns were obtained 

5 for £ cerevisiae strains bearing homozygous disruptions of ERG2, ERGS or ERG6 and 
their parent strain BY4743. In an effort to demonstrate a correlation between a genetic 
mutation and a biological drug response, the transcript profile was determined following 
exposure of BY4743 cells to sub-MIC levels of a number of inhibitors of ergosterol 
biosynthesis. Comparison of these transcript profiles lends insight into known and 

10 previously unknown modulations of gene expression. Genes involved with general 
cellular response to these agents can be distinguished from genes modulated in cells 
carrying a single genetic aberration in this pathway. A set of marker genes was 
ascertained from these transcript profiles, and primer/probe sets designed to verify these 
transcription patterns using quantitative RT-PCR. This method confirms the changes in 

1 5 transcript levels observed using hybridization to microarrays. 

Enzymes in the yeast ergosterol biosynthetic pathway are the targets of a number 
of antifungal agents including azoles, allylamines, and morpholines. In order to 
understand the response of £ cerevisiae cells to perturbations in the ergosterol pathway, 
genome-wide transcript profiles following exposure to a number of antifungal agents 

20 targeting ergosterol biosynthesis (clotrimazole, fluconazole, itraconazole, ketoconazole, 
voriconazole, terbinafine and amorolfine) were obtained. These profiles were compared to 
the transcript profiles of strains containing deletions of late-stage ergosterol genes: ERG2, 
ERGS, or ERG6. This comparison allowed the response to chronic genetic perturbations 
to be compared to the response resulting from acute exposure to chemical agents. Data 

25 sets from these experiments were compared on a gene-by-gene basis and filtered to derive 
a subset of genes exhibiting a transcriptional response to ergosterol perturbation (i.e., 
class-specific, treatment-affected genes). Among the 234 genes identified, the majority of 
genes from the ergosterol pathway were found to be responsive. The levels of transcripts 
from ERG3, ERG4, ERGS, ERG6, ERG1 1 and ERG24 were measured over time by 

30 quantitative PCR. Comparison of these measurements to microarray results showed 
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similar trends in expression levels. In addition to ergosterol biosynthetic genes, 36 
mitochondrial genes and a number of other genes with roles related to ergosterol function 
were responsive, as were a number of genes responsive to oxidative stress. 
Transcriptional changes related to heme biosynthesis were observed in cells treated with 

5 chemical agents, suggesting an additional effect of exposure to these compounds. The 
expression profile in response to a novel imidazole, PNU-144248E, was also determined. 
The concordance of responsive genes suggests that this compound has the same mode of 
action as other imidazoles. Thus, genome-wide transcript profiles can be used to predict 
mode of action as well as to characterize expression changes in response to perturbing a 

1 0 metabolic pathway. 

The invention is further illustrated by way of the following examples, which are 
intended to elaborate several embodiments of the invention. These examples are not 
intended, nor are they to be construed, as limiting the scope of the invention. It will be 
clear that the invention may be practiced otherwise than as particularly described herein. 

15 Numerous modifications and variations of the present invention are possible in view of the 
teachings herein and, therefore, are within the scope of the invention. 

EXAMPLES 

Example 1: Materials and Methods for Practicing the Invention with Yeast 
20 Reagents 

Clotrimazole, biotin-1 1-CTP and biotin-16-UTP were obtained from Sigma 
Chemical Company; ketoconazple was from Biomol Research Labs, Inc.; itraconazole was 
from Accurate Chemical and Scientific Corp., Westbury NY; and terbinafine was from 
TCI, Tokyo. Amorolfine, fluconazole, voriconazole and PNU-144248E were synthesized 
25 at Pharmacia & Upjohn. 
Yeast Strains 

£ cerevisiae BY4743 (a/a his3A/his3A, leu2A/leu2A, +/lys2A, metl5A/+, 
ura3A/ura3A), strain 30568 (a/a, erg6A/erg6A, his3A/his3A, leu2A/leu2A, +/lys2A, 
metl5A/+, ura3A/ura3A), strain 30590 (a/a, erg5A/erg5A, his3A/his3A, leu2A/leu2A, 
30 +/lys2A, metl 5A/+, ura3 A/ura3 A), strain 30788 (a/a, erg2A/erg2A, his3A/his3 A, 
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leu2A/leu2A, +/lys2A, metl5A/+, ura3A/ura3A). All strains were obtained from Research 
Genetics (http://www.resgen.com) . 
MIC Determinations 

A 10 ml culture of YPD medium (1% Difco yeast extract, 2% Difco peptone, 2% 
5 glucose) was inoculated from a colony and grown overnight at 30° C to saturation 1 to 2 
x 10 8 cells/ml). The culture was then diluted to an OD 6 oo = 0.1 and 50 ml used to 
inoculate a 96 well U-bottom culture plate (Costar, Coming, NY) containing 50 ml of 2- 
fold serially diluted test compounds starting at 100 mM. The culture was allowed to grow- 
in the presence of drug for 24 hrs at 30°C in a SpectraMax Plus (Molecular Dynamics 

10 Corp., Sunny Vale, CA) and the minimum inhibitory concentration (MIC) was determined 
. as the first well with no growth. The growth curve obtained for each drug was used to 
identify a sub-MIC concentration needed to slow growth without affecting the endpoint 
cell concentration. The endpoint cell concentration was required to be sufficiently high 
enough to yield an adequate amount of poly A(+) RNA for expression analysis. 

15 Cell Culture and Drug Exposure 

A 40 ml culture of YPD medium was inoculated from a colony and grown 
overnight at 30° C and 140 rpm to saturation (~ 1 to 2 x 10* cells/ml). The culture was 
diluted ten-fold to allow growth through one doubling (90 min). For drug-treated cells, 
drug was added to each culture at a concentration equal to 0.5 X MIC (concentrations used 

20 were: 0.6 jiM clotrimazole, 25 |xM fluconazole, 0.6 \xM itraconazole, 4 |iM ketoconazole, 
0.6 \xM PNU-144248E, 0.19 jiM voriconazole, 0.1 jiM amorolfine, and 50 jiM 
terbinafine), and the culture incubated for 90 minutes. Strains containing mutations were 
harvested at late-logarithmic phase (- 5 x 10 7 cells/ml). Cells were pelleted at 1500 rpm 
20° C for 5 min in an IEC MP4R centrifuge. The pellets were washed with 1 ml water at 

25 22°C, previously treated with diethylpyrocarbonate (DEPC) to inactivate RNases. Cell 
pellets were placed on ice and RNA was extracted immediately to minimize change in 
expression profile. ... 
RNA Preparation and Hybridization to Afiymetrix Microarrays 

RNA preparation and hybridization to Afiymetrix DNA microarrays were 

30 performed as described by Wodicka et aL, supra. Briefly, cells were harvested, washed 
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with water and lysed quickly and RNA extracted using hot acidic phenol (Lin el a/., 1996. 
pp.43-50, in P.A. Kreig (ed.)> A laboratory guide to RNA isolation and synthesis. Wiley- 
Liss, Inc., New York). Poly A(+) RNA isolated using Oligotex mRNA kit from Qiagen 
Inc. (Valencia, CA). Double-stranded cDNA synthesized using the Superscript Choice 

5 system (GibcoBRL, Gaithersburg, MD). Labeled cRNA was generated with biotin-1 1- 
CTP and biotin-16-UTP using the Megascript System (Ambion Inc., Austin TX) for 
hybridization to the microarrays. Eleven mg of the resulting cRNA was used to probe the 
four arrays comprising the yeast genome, following the method recommended by 
Affymetrix Inc. (Santa Clara, CA) and described by Wodicka er a/., supra. 

10 Experimental Design and Methodology 

Because of the scope of each data set obtained from a microarray hybridization, 
exposures were limited to a single concentration at a single time point. The concentration 
and duration of exposure were chosen to elicit a biological effect while attempting to 
minimize secondary effects due to prolonged exposure. Since S. cerevisiae will double 

15 approximately every 90 minutes in rich media (as shown in Figure 1 ), one doubling time 
was selected for the duration of exposure. Sub-MIC concentrations sufficient to slow but 
not stop growth were used, as described above and as illustrated in Figure 2. Cells were 
exposed to agents and harvested during mid-logarithmic growth phase when the cell is 
most rapidly dividing and expression patterns seen in slow-growing or saturated cultures 

20 are not observed. The S. cerevisiae strain used in this study was BY4743, the parent strain 
for homozygous deletion strains used, and isogenic to that used for sequence 
determination and microarray synthesis. 

RNA was prepared following exposure of the parent strain BY4743 to five 
previously characterized azoles (clotrimazole, fluconazole, itraconazole, ketoconazole, 

25 voriconazole), a novel imidazole, PNU-144248E, (as shown in Figure 3), an allylamine 
(terbinafine) and a morpholine (amorolfine) as described above. In addition, RNA was 
prepared from three strains each bearing a homozygous deletion of ERG2, ERGS, or 
ERG6 encoding C-8 sterol isomerase, sterol C-22 desaturase, and sterol C-26- 
methyltransferase respectively. RNA preparations were also made from two untreated 

30 control cultures. 
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Figure 3 also shows the genes involved in ergosterol biosynthesis from farnesyl 
pyrophosphate in S. cerevisiae. Genes in the deletion strains are given in bold font. The 
site of action of the azole, allylamine, and morphoiine antifungal agents are noted with 
arrows, and the structures of the compounds used in this study are given. 

5 

Example 2: Identification of Gene Expression Patterns 

Each Afiymetrix yeast genome set represents 6593 ORFs, including 172 control 
genes. Thus the eleven data sets from the treatments comprising this study represent 
72523 data points. Hybridization intensity values are expressed in ADI units, as described 

10 above. Following normalization to account for variations in chip intensity (described 
above), filter n A" was applied requiring the experimental ADI value to be above 50 ADI 
units, thereby limiting the data to 29428 points. Applying a second filter, "B", requiring 
the baseline ADI value to be greater than 50 ADI units, further limits the data to 20697 
points. The ADI values used in the filters were chosen after examination of the 

15 background ADI value calculated by the GeneChip software and the ADI values for 

selected unexpressed genes (e.g. haploid-specific genes in diploid cells); both approaches 
gave a value of up to -25 ADI units as that corresponding to no signal. 

Experimental ADI values were then expressed on a gene by gene basis as the sum 
of experimental ADI with baseline ADI, and as the logio of the ratio of the experimental 

20 ADI to the baseline ADL To account for inter-chip variation in signal intensity, log(ratio) 
distributions obtained for each chip were centered by subtracting the geometric mean from 
the individual values as described above. This has the effect of multiplying ADI values by 
at most a factor of three in the experiments described here, and allows comparison of 
hybridization values obtained from different chips. These values are plotted in Figure 4. 

25 Points are colored by position in the distribution of log ratios, using white for points 

within one standard deviation of the distribution, black for those points between one and 
two standard deviations of the distribution, and gray for those beyond two standard 
deviations. 

Genes with ADI ratios beyond one standard deviation were considered to be 
30 responsive to the treatment (i.e., treatment-affected genes). By this criterion, 1 1 54 genes 
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responded by increased mRNA level in at least one treatment; 1358 genes responded by 
decreased levels relative to baseline. It is likely that expression changes specific for the 
particular treatment as well as other perturbations in transcript level due to uncontrolled 
changes in microenvironment, unstable transcripts, etc. are detected in these profiles. To 

5 distinguish those genes indicative of a response of the cell to perturbation in ergosterol 
biosynthesis, genes responding in at least five of the eleven experimental treatments were 
considered in the subsequent analysis. The intention of selecting by these criteria is to 
identify genes that have a convergent pattern of expression across many individual 
treatments which may therefore be indicative of a common response. 

10 156 genes showed significant increase in transcript level in five or more treatments 

and 78 showed significantly decreased transcript level in five or more treatments. These 
were annotated using the biological role assigned by the Proteome YPD database (Hodges 
et a/., supra) and grouped accordingly. The number and characteristics of the responsive 
genes in these biological role groupings are shown in Table 1. The column headed 

15 "number of genes" indicates the number of individual genes in the category responsive in 
five or more treatments; the column "number of hits" indicates the number of times a gene 
was responsive to a treatment The categories in Table 1 are ordered by number of genes. 
The category with the largest number of hits is the unannotated or "unknown" group of 
genes, which contains 53 genes. Next most abundant is the group of 36 responsive 

20 mitochondrial genes, followed by 22 responsive genes involved in biosynthesis of lipids, 
fatty acids and sterols. This group includes 9 genes in the ergosterol pathway. The other 
categories in Table 1 include 24 genes involved in translation and protein processing, 20 
stress-response genes, 16 membrane proteins, and a number of other genes involved in 
other biological pathways. The category "other related genes" refers to genes related 

25 through other experimental results to ergosterol perturbation, and are described in Tables 2 
and 3 and Figures 5 and 6. The category "other genes" includes responsive genes which 
were the sole representative of a particular biological pathway, and whose relationship to 
ergosterol perturbation could not be discerned. 
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Table 1 lists genes responding in five or more treatments. Shown are (1) the 
number of responsive genes in each functional class (biological role), (2) the number of 
treatments in which these genes responded, (3) the corresponding ratio of hits per gene, (4) 
the number of drug treatments in which these genes responded, (5) the corresponding 
5 average of drug hits per gene, (6) the number of the genetic disruptions with alterations in 
these genes (# of knockouts (KOs)), (7) the corresponding number per gene, (8) the 
number of genes which also respond to saturated culture, (9) the percentage of genes 
responding to saturated culture, (10) the percentage of genes responding to drug 
treatments, (1 1) the percentage of genes responding to genetic perturbation, and (12) the 

10 direction of the response (increased (up)or decreased (down) expression). Functional 
annotations were obtained from the YPD database (Hodges et al. y supra). 

Responsive genes are also characterized by the relative number of responses due to 
chemical vs. genetic perturbation (drug/gene and KO (knockout)/gene, respectively). This 
is indicated in the columns headed %D and %KO in Table 1 . Values ranged from highs of 

15 88-87% of the response due to chemical perturbation, in the case of membrane-associated 
proteins, to lows of 62% of the response due to response to chemical perturbation, and 
therefore 38% due to response to genetic perturbation, seen for the class of lipid / fatty 
acid / sterol pathway genes. 

The column headed "Sat", in Table 1, indicates the number of responsive genes in 

20 each category whose transcript levels changes as the cells enter saturated growth and 
undergo the diauxic shift (DeRisi et al. 9 supra-, Hodges et aL 9 supra\ G. Bammert and J. 
Fostel, unpublished). The proportion of responsive genes which are also altered in 
response to saturated growth varies from 0% of genes in several pathways to a high of 
78% and 83% of ergosterol-responsive genes related to amino acid and carbohydrate 

25 metabolism, respectively. This behavior was used to subset the unannotated genes in an 
attempt to further characterize this group of genes. Genes which are responsive to culture 
saturation have more hits per gene that non-responsive genes, although the data are 
insufficient to determine if this difference is significant. 
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Example 3: Responsive Genes in the Ergosterol Pathway 

Figure 5 shows the genes involved in the biosynthesis of ergosterol from acyl- 
CoA, highlighting some of the responsive genes identified above. Genes listed in bold 
type in Figures 5 and 6 were responsive in this study; those with italicized names showed 
5 a decrease in transcript relative to baseline, non-italicized gene names indicate an increase. 
The number of responsive treatments is indicated with superscripts, where the first number 
gives the total of the three disruptant strains in which the particular gene responds, and the 
second number gives the total of the eight drug treatments in which the gene was 
responsive. 

10 Most notably, nine genes in the ergosterol pathway respond with increased 

transcript levels to the conditions used in this study. Also responding is NCP1 (NADP- 
cytochrome P450 reductase), which is the electron donor for squalene epoxidase, 
lanosterol 14a demethylase and sterol C22 desaturase (Sutter et a/., 1989, Biochem. 
Biophys. Res. Commun 160:1257-1266) and is coordinately regulated with ergosterol 

15 genes. The ergosterol pathway had the highest proportion of responsive genes identified, 
in agreement with previous studies showing that this pathway is the target of azoles and 
responsive to modulations of ergosterol level. 

The biosynthesis of ergosterol, similar to other complex biological processes, 
involves the coordination of many factors by which the cell regulates the synthesis of an 

20 essential component. ERG1 9 is reported to contribute to regulation of flux through the 
mevalonate pathway (Berges et a/., 1997, J. BacterioL 179:4664-4670), and is increased in 
response to perturbations here. Expression of ERG3 has been shown to increase not only 
following treatment with antifungal agents (Smith et al. 9 1996, MoL Cell Biol. 16:5427- 
5432) but was also increased by mutations in ERG2, ERGS, and ERG6 (Arthington- 

25 Skaggs et a/., supra). These data are supported with observations reported here with 

ERG3 showing an increased expression following drug treatment in addition to increased 
expression in the three ERG deletion strains. Additionally, expression of NCP1 has been 
shown to increase five fold in a strain constitutively overexpressing ERG1 1 (Thorsness et 
aL 9 1989, Mol. Cell Biol. 9:5702-12), consistent with coordinate increase in transcript 

30 levels of these two genes in this study. 
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One unexpected result relates to ERG9, a branch-point in the isoprenoid pathway. 
Others have reported that this gene is subject to regulation in strains carrying mutations in 
ERG3, ERG7, and ERG24, as well as cells treated with ketoconazole (Kennedy et aL 
supra). In contrast to earlier observations, levels of ERG9 transcripts were observed to 
5 decrease in four treatments in the current study. This does not meet the criterion of 

"transcript level altered in five treatments" used to define responsive genes in the current 
study. Another gene in the pathway, ERG1 , was not observed to respond to treatments 
used here. ERG1 activity appears to be regulated by protein localization (Leber et aL, 
1998, . Mol. Biol. Cell 9, 375-386) however it has also been reported to respond to 

10 oxygen and sterol-limitation (M'baya et aL, 1989, Lipids 24 1020-1023). In a study using 
promoter fusions as a read-out of transcriptional changes, Dimster-Denk et aL (1999, J. 
Lipid Res. 40:850-860) observed the greatest response to fluconazole treatment from 
ERG2, ERG8, ERG9, ERG12 and ERG19. With the exception of ERG2, these are found 
in the isoprenoid pathway rather than the ergosterol pathway, and were not observed to 

1 5 respond to treatment used in the current study. 

Interestingly, the ergosterol genes were highly responsive to genetic disruption of 
ERG2, ERGS, and ERG6. Seven of the ten responsive ergosterol genes were increased in 
all three mutant strains (Figure 5). Transcripts from the three remaining genes, ERG2, 
ERGS, and ERG6 were increased in two of three mutant strains. Not surprisingly, each 

20 was significantly decreased in the strain bearing a knockout of that gene. Responsive 
genes in other pathways were also altered in all three deletion strains, but this was more 
sporadic than the uniform response of genes in the ergosterol biosynthetic pathway. 

Example 4: Other Responsive Genes Related to Lipid, Fatty Acid and Sterol 
25 Biosynthesis 

In addition to participating in cell membranes, esterified ergosterol is found in lipid 
particles, which may serve as storage reservoirs or as intermediates in intracellular 
transport (Yang et aL, 1996, Science. 272:1353-1356; Zinser et aL, supra). ARE1 is one 
of the two genes responsible for the esterification of ergosterol, which is the final step in 
30 the pathway leading to ergosteiyl esters which are accumulated for storage and 
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maintenance of sterol homeostasis (Yang et a/., supra). CYB5 (cytochrome B5) was 
identified by an ability to overcome ketoconazole hypersensitivity when overexpressed in 
an ergl 1 background (Truan et aL, supra). Presently little else is reported about CYB5. 
ACH1 (acetyl CoA hydrolase) is specific for acyl CoA molecules formed during 

5 the de novo synthesis of saturated fatty acids and may be responding to an overproduction 
of sterol intermediates formed during inhibition of the pathway. FAS1 (fatty-acyl-CoA 
synthase) deletion strains have been shown to posses reduced levels of ergosterol esters 
and sphingolipids indicating a possible role in lipid biosynthesis and metabolism (Daum et 
aL, supra). LCB1 (serine C-palmitoyltransferase) is the first enzyme involved with the 

10 biosynthesis of the long chain base component of sphingolipids. Increase in transcripts of 
this transferase suggests a potential interaction between the ergosterol and sphingolipid 
biosynthetic pathways in yeast. 

Transcript levels of several other genes involving lipid, fatty acid, and sterol 
metabolism were decreased. The SUR2 product hydroxylates the sphingoid C-4 of 

15 ceramide (Haak et aL, 1997, J. Biol. Chem. 272, 29704-29710), and the decrease observed 
in this transcript may demonstrate another correlation between sphingolipid and sterol 
pathways. ELOl encodes an enzyme responsible for elongation of fatty acids, and decline 
of this transcript level under conditions of this study may indicate a compensatory 
response in cellular fatty acid content to limited sterols following perturbation of the 

20 ergosterol pathway. OLE I, encoding delta-9 desaturase, is needed for formation of 

unsaurated fatty acids and also shows decreased transcript level here. OLE1 is repressed 
by the presence of saturated fatty acids, and thus the decline in OLE1 transcript may 
indicate an increase in saturated fatty acids, possibly another compensatory response to 
altered ergosterol. Interestingly this fatty-acid responsive repression of OLE1 is mediated 

25 through the FAA1 and FAA4 products (Cobon & Haslam, supra) 9 and the level of FAA4 
transcripts decreased under the conditions used in this study. The connection of fatty acids 
to perturbation of ergosterol has not be demonstrated by other studies and may indicate a 
re-structuring of cell membrane in response to reduced ergosterol. 
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Example 5: Genes with Products Related to Cell Envelope Structure and Function 

While ergosterol is found throughout the cell membranes, it is most abundant in 
the plasma membrane and secretory vesicles and is reported to play an important role in 

5 mitochondrial respiration (Daum et aL y supra; Parks, supra; Zinser et al. 7 supra). 

Depletion of ergosterol with concomitant accumulation of sterol intermediates can result 
in the alterations of membrane functions, synthesis and activity of membrane bound 
enzymes, mitochondrial activities, and uncoordinated behavior of the yeast cell (Parks, 
supra; Vanden Bossche, 1 985, Curr. Top. Med. MycoL 1:313-351). Genes encoding cell 

10 envelope proteins or products required to modify envelope proteins are included in the 
responsive subset, as shown in Table 2. Changes in transcript level of these genes may 
thus represent a response to ergosterol perturbation by treatments used here. 

Table 2: Genes Encoding Products Associated with Cell Envelope Structure or 

15 Function 

Genes involved in protein glycosylation: 

ALG5 X \ ALG3, ALG9, ALG6 3 *, ALG8 l \ ALG10 
transferase (PMT family; TMT2 32 ) 
CWH41,GLS2 1 ' 5 , MNS1 

20 Genes encoding O-glycosylated proteins: 

AGAU AGA2, BAR1, CTR1, CTS1 2 *, CWP1, CWP2, DAN1 FET3, FLOl, FLOS, 
FL09, FLO10, FUS1, GAS1, GIT1, HKR1, HSP150, KEX2, KNH1, KRE1 3 ' 4 , 
KRE9*, MTO2, MSB2, PEX15, PGM1, PGM2, PIR1, PIR3, PRB1 1 4 , SAG1, 
SEC20, SED1, SED4, SLG1, SRCM, SSR1, STA1, STA2, IW1 V % TlRl* 3 , TIR2, 

25 YLR110C 

Responsive membrane proteins (only responsive genes are shown) 

BAP?*, BAP3 0 6 , D/PJ 3 4 , FET4 X \ HNM1 2 \ HXT1 0 ' 5 , HXT3 2 4 , PH087 0 5 , 
RCS1 1 \ SMF1 3 ' 4 , YDR373W** 4 , GR138C 05 , KL146W 1 4 , YNL065J^\ 
YNL321W 2 - 3 , YOR161&*, YOR271C* 4 

30 GPI anchored proteins: 
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AGal, AGA1, CWP1, CWP2, EGT2 1,4 , FLOl, FLOS, FL09, GAS1, ICWP, 

KRE1 3 ' 4 , PRY3, SED1, TIP1 2S , TIR1 33 , TIR2, YAP3, YCR089W, YDR055W 0 5 , 

YDR134C, YDR534C, YEL040W, YER150W 05 , YGR189C, YJR151C, YLR1 10C, 

YNL300W 3 ' 7 , YOR009W, YOR214C 
5 Genes involved in synthesis of f}-l,6-glucan: 

CWH53, FKS2, KNR4, HKR1, KRE1 3,4 , KRE5, KRE6, KRE9 M t KRE11. SKN1 
Genes related to osmotic stress: 

Part of osmotic stress response signal pathway: HOGl^ 3 

Transcription factor with a role in salt tolerance: CIN5 1 ' 6 
10 Induced by osmotic stress: DDR48 3,4 , DDR2, GRE1 , GRE2 3 * 4 , GRE3, HOT1 , PTP2, 

PTP3 SIP18, SKN7, SKK2, SSK1, STE11, SLN1, SHOl, YPD1 
Responsive proteins involved in secretion (only responsive genes are shown): 

VID24 0 7 , RET2 34 , SEClf 5 , COPl 2 ^, LSTf? A , NHX1 05 , YGL054C 06 , 

YHR138C 0,5 

15 Table 2 lists genes encoding products associated with cell envelope structure or 

function. Genes given in bold type were responsive in the study; italicized bold type 
indicates genes with decreased transcript level Superscripts indicate the number of 
treatments to which the gene responded. The first number in the superscript indicates the 
number of the three genetic perturbations that elicited a response, and the second number 

20 indicates the number of the eight drug treatments that elicited a response. Lists of genes 
were obtained from YPD database (Hodges et ai, 1999) and (Daum et aL, supra; Haak et 
aU supra\ Hamada et aL, 1998, Mol. Gen. Genet 258:53-59; Hamada et aL, 1999, J. 
Bacteriol. 181:3886-3889; Inoue et aL, 1995, Eur. J. Biochem. 23:845-854). 

Transcript levels of several genes encoding proteins with GPI anchors on the 

25 mature product (Hamada et aL, 1 998, supra) are also increased in this study (Table 2). 

This observation may suggest a compensatory response to alterations in plasma membrane 
composition following experimental treatments. Alternatively this response may indicate 
cross-talk between paths of inositol utilization, since inositol is incorporated into both GPI 
anchors and sphingolipids, and alterations in LCB1 seen here may indicate a regulation of 

30 sphingolipid biosynthesis under the study conditions. 
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A decrease in ergosterol may affect genes associated with the formation and 
maintenance of cell wall. Two of these, GSC2, encoding a component of |M 3-glucan 
synthase (Inoue et aL, supra), and KRE1 , encoding a protein required for £-1 .6-glucan 
assembly (Boone etal, 1990, J. Cell Biol.l 10:1 833-1 843) may be responding to 
5 alterations in cell wall function resulting from a decline in the ergosterol component of the 
plasma membrane. TIR1 , a stress-induced cell wall structural protein, was found to be 
highly upregulated in all of the deletion mutants and by several drug treatments. TIR1 is a 
glycoprotein residing in the ceil wall that may act to protect the membrane from stress 
(Kowalski et a/., 1995, MoL Microbiol. 15:341-353). Recently, a link between protein N- 

10 glycosylation and the biosynthesis of 0-1, 6-glucan has been described showing incomplete 
N-chain processing leads to loss of {J-l, 6-glucan (Shahinian et a/., 1998, Genetics. 
149:843-856). Transcripts of several genes encoding products needed for synthesis of N- 
linked oligosaccharide unit as well as the PMT2 transferase are increased following 
ergosterol perturbation. These are listed in Table 2, as are a number of responsive genes 

15 encoding proteins predicted to be O-glycosylated in the mature form (Hodges et aL, 
supra). It is possible that transcript levels of components of the cell envelope and 
enzymes responsible for modifying them may be altered in response to alterations in the 
plasma membrane resulting from perturbation of ergosterol biosynthesis. 

20 Example 6: Responsive Mitochondrial Genes 

Unlike higher eukaryotes, 5. cerevisiae has higher levels of ergosterol in the 
mitochondrial inner membrane than other mitochondrial membranes (Tuller & Daum, 
1995, FEBS Letters 372:29-32; Zinser et al. y supra). Previous work has shown that 
changes in sterol composition of membranes can affect membrane function (Cobon & 

25 Haslam, supra; Parks, supra), and it could be predicted that perturbations of ergosterol 
levels within the cell may affect the function of mitochondrial enzymes. This pattern does 
indeed emerge with the increase in transcript level of several components of the 
mitochondrial electron transport system, as shown in Table 3. 

30 Table 3: Genes Encoding Mitochondrial Proteins 
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Electron transport complexes, inner mitochondrial membrane: 



Ubiquinol cytochrome c reductase complex III 



COB, CYT1 24 , COR1 2 - 3 , QCR2, QCR6 1,4 , QCR7, 8, 9, 10, RIP1 
Cytochrome c (anaerobic isoform) 



1.5 



5 



CYC7 1 * 5 
Cytochrome c oxidase 



COX1, COX2, COX3, COX4 3 4 , COX5A 2,4 / COX5B l \ 



COX6, COX7, COX8 05 , COX9, COX12, COX13 3 4 
ATP synthase 



10 



ATP1 3 - 5 , ATP2, ATP3, ATP4, ATP5, ATP6, ATP7, ATP8, ATP9, 



ATP14 0 ' 6 , ATP15 3 ' 4 , ATP16 0 - 5 , ATP17, ATP20, INH1 0 ' 6 , TIM1 1 
ADP / ATP carrier protein 

AAC1, AAC3 l \ PET9 3 - 3 , YPL134C 1,4 , YPR021C 
Elements of the TCA cycle: 
15 CIT1, ACOl 2 \ IDH1, IDH2, KGD1 1,5 , KGD2, LPD1, LSC1, LSC2, 

SDH1 2 \ SDH2, SDH3, SDH4, FUM1 2 ' 4 , MDH1 
Other responding mitochondrial proteins: 

ARG5,6° 5 , ARG7 1 6 , COT1 3 5 , GPD2?\ IDP1 0 ' 5 , ILV6 3 5 , MRP1 0 6 , 

PFK27 0 > 6 , SHMl 1 ' 4 , SOM1 0 6 , SOD2 2 \ TIM23 33 , YAH1 2 \ YIL154C 06 
20 Table 3 lists genes encoding mitochondrial proteins. Genes given in bold type were 

responsive in the study; italicized bold type indicates genes with decreased transcript level. 
Superscripts indicate the number of treatments to which the gene responded. The first 
number in the superscript indicates the number of the three genetic perturbations that 
elicited a response, and the second number indicates the number of the eight drug 
25 treatments that elicited a response. Lists of genes were obtained from the ExPASy / 
Boehringer Mannheim metabolic pathways (http:/Avww.expasy.hcuge.ch/cg!«bin/search- 
biochem-index) and from YPD (Hodges et aL 9 supra). 

Transcript levels of four members of the cytochrome c oxidase complex, COX4, 
COX5A, COX8, and COX 13 were also increased. Cytochrome oxidase resides in the 
30 mitochondrial inner membrane and catalyzes the transfer of electrons from reduced 
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cytochrome c to molecular oxygen. Similarly, transcript levels were increased for four 
members of cytochrome c reductase, RIP1, CYT1, QCR6, and COR1, a component of 
mitochondrial respiration which catalyzes the transfer of electrons to cytochrome c with 
concomitant transfer of protons across the inner mitochondrial membrane. Another 
5 critical respiratory enzyme residing in the mitochondrial inner membrane is ATP synthase. 
Five genes encoding subunits of the enzyme were observed to be upregulated (ATP1, 
ATP14, ATP 15, ATP16, and INH1). Other genes involved with energy generation whose 
transcripts were increased are listed in Table 3, as are a number of other responsive genes 
encoding mitochondrial proteins, including YAH1, which contains a cytochrome c heme- 

10 binding signature. (YPD; Hodges et al,, supra). 

Perturbation of mitochondrial electron transport could arise from a decrease in 
ergosterol in the inner membrane due to interference of the treatments with ergosterol 
biosynthesis, or from a direct interaction between the chemical agents used and the 
mitochondrial enzyme complexes. For example, the cytochrome c complex has been 

15 analyzed with the use of metal-chelating drugs, which block reduction at the ubiquitinol 
oxidation site (Boumans et al 9 1997, J. Biol. Chem. 272: 1 6753-1 6760). Since more than 
half of the responsive mitochondrial genes are found to respond in both genetic alterations 
and in response to drug treatment, it is likely that the effect is mediated through ergosterol 
biosynthesis and does not arise as a direct consequence of drug action. 

20 

Example 7: Response to Oxygen Stress 

Transcripts from four of the five members of the hypoxic gene family ( ANB 1 , 
COX5b, CYC7, and HEM13) are reduced in response to treatments used in this study 
(Table 4). Two of these genes, CYC7 and COX5b, are responsible for encoding the 

25 hypoxic isoforms of cytochrome c and cytochrome c oxidase. Their expression level has 
been shown to depend on the level of available oxygen (Burke et a/., 1997, J. Biol. Chem. 
272:14705-14712). In addition, expression of HEM13 is repressed by oxygen and heme 
(Keng, 1992, Mol. Cell Biol 12:2616-2623). Thus the decrease in transcript level seen for 
these anaerobic-induced genes may be in response to increased levels of intracellular 

30 oxygen. Another piece of evidence to support this hypothesis is the observation that 
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transcripts from four genes involved with oxidative stress response (AHP1, GRE2 
YDR453C, and SOD2) are also increased (Table 4). 

Table 4: Genes Responding to Oxygen Stress 
5 Genes induced by hypoxia: AAC1, ANB1, COX5B l \ CPR1 , CYC7 U \ 

ERG11 3,3 , HEM13 Qfl 9 HMG2, OLEl 2 \ ROX1, 

SUT1 

Genes induced by anarobiasis: CYB2, ERG11 3 \ TIPl 2 * 5 , TIR2, TIR1 3J , 

10 

Major oxidant scavenging enzymes: CCP1, CTTl, TSA1 , SOD1, SOD2 2 4 , TRR1 , TRX1 , 
RX2, 

GLR1, YDR453C 2,4 , YCL035, AHP1, GRE2 3 ' 4 

15 Pentose phosphate pathway shunt: ZWF1 3,4 

Table 4 lists genes encoding proteins responsive to oxygen stress. Genes given in 
bold type were responsive in the study; italicized bold type indicates genes with decreased 
transcript level. Superscripts indicate the number of treatments to which the gene 

20 responded. The first number in the superscript indicates the number of the three genetic 
perturbations that elicited a response, and the second number indicates the number of the 
eight drug treatments that elicited a response. Lists of genes were obtained from YPD 
(Hodges et al 9 supra) and from (Jamieson, 1998, Yeast 14, 1511-15; Slekar et al y 1996, J. 
Biol. Chem. 271:28831-28836; Zitomer et a/., 1997, Kidney International 51:507-513). 

25 Recently, Kwast et al (1999, Proc. Natl. AcacL Sci. USA. 96:5446-5451), have 

determined that OLE1, CYC7, and COX5b belong to a separate class of hypoxia- 
responsive genes whose regulation is sensitive to carbon monoxide. These authors have 
concluded that cytochrome c oxidase may act as an oxygen sensor for the induction of 
hypoxic genes. In the course of normal catalysis, cytochrome c oxidase transfers two 

30 electrons to molecular oxygen; incomplete reactions can result in the formation of reactive 
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oxygen species. Machida et al. (1998, J. Bact 180:4460-4465) have shown that exposure 
to famesol results in the generation of reactive oxygen species through an indirect effect 
on mitochondria] electron transport. Famesol can be generated from farnesyl 
pyrophosphate, a precursor to sterols. It is possible that limitation of ergosterol could also 
5 impact the activity of cytochrome c oxidase or other components of the electron transport 
chain, leading to the generation of reactive oxygen species. 

Example 8: Heme Regulation 

Transcripts of HEM 1, the first gene involved in the biosynthesis of heme, 

10 increased in response to eight of the eleven treatments tested. Interestingly, all eight were 
drug treatments, suggesting a compensatory response to drug treatment not represented in 
the mutant strains. It is possible that this results from heme depletion by the drugs, or 
from an unanticipated secondary effect of drug exposure. Transcript levels of HEM13, the 
rate-limiting step of heme biosynthesis (Zitomer & Lowry, 1992, Microbiol. Rev. 56, 1- 

15 11) were reduced in response to seven drug treatments and again, in none of the 

mutational perturbations. HEM 13 has been shown to be repressed by heme and oxygen 
(Amillet et al 9 1996, J. Biol. Chem. 271:24425-24432; Keng, supra), suggesting that these 
may be limiting under the conditions of the drug treatments. 

Heme plays a central role in sterol synthesis and has been shown to 

20 transcriptionally regulate several genes involved with the process (Parks & Casey, 1995, 
Ann. Rev. Microbiol. 49:95-1 16; Thorsness et al, supra). The accumulation of 5- 
aminolevulenic acid, the product of Hemlp, derepresses 3-hydroxy-3-methyl-glutaryI 
CoA reductase, leading to increased levels of 2,3-oxidosqualene (Lorenz et al. y supra). 
Heme is also required for the enzymatic activity of Erg3p (C-5 sterol desaturase), Erg5p 

25 (C-22,23 desaturase) (Parks, supra). Ergl Ip contains heme, and shows heme-regulated 
expression (Turi & Loper, supra). 

Heme also plays a role in sensing intracellular oxygen levels (Zitomer & Lowry, 
supra). Genes regulated by heme are listed in Figure 6. In the present study, transcripts 
from four heme-induced genes are increased and transcripts from three heme-repressed 

30 genes are decreased, consistent with induction of heme regulation of expression. 
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Interestingly, ERG1 1, another heme-repressed gene, is increased, however this is likely 
due to additional regulation of this gene by ergosterol levels, as discussed above. 

Because transcripts of HEM 1 and HEM 13 in the heme pathway respond only in 
cells treated with chemical agents, it is possible that levels of heme are directly affected by 

5 chemical interaction with the agents used. This prediction would be consistent with 

alterations in expression of heme-containing proteins. Heme containing proteins identified 
by the YPD data base (Hodges et aL 9 supra) are listed in Figure 6. Transcripts of eight 
genes encoding heme-containing proteins are responsive to the conditions used, although 
two are decreased and six are increased. Inspection of the responsive heme-containing 

10 proteins reveals that they are members of groups regulated by other factors, including 

oxygen level and heme level. Thus their behavior does not shed additional light on effects 
of azoles and other agents used here on other heme-containing molecules in the cell. 

Example 9: Correlation of the Expression Pattern of a Novel Azole to Other 
15 Conditions 

Included in this analysis was the expression profile in response to treatment with a 
compound containing an azole moiety identified in a whole cell screen of S. cerevisiae. 
PNU- 144248 contains an imidazole ring, yet is structurally distinct from the other azoles 
tested. The rationale for including it in this study is to determine if exposure to this 

20 unknown compound results in a pattern of expression similar to that seen in response to 
treatment with azoles of known biochemical function, thereby suggesting a similar mode 
of action. Such information is useful to assess the predictive ability of expression profiles 
observed in response to an agent with an unknown mode of action. Of the 156 treatment- 
affected genes with increased transcript levels following ergosterol perturbation, 144, or 

25 92%, were increased also in response to treatment with PNU- 144248. All of the 

ergosterol, lipid, fatty-acid and sterol metabolism genes and 17 of the 19 genes involved 
with energy generation were included (the exceptions were QCR6 and COX13). 29 of the 
34 of the unknowns were also included. From the set of 78 treatment-affected genes with 
decreasing transcript levels following treatment, 40, or 51%, also decreased following 

30 treatment with PNU-144248. These data suggest PNU-144248 does indeed behave as an 
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azole as measured by cellular responses at the level of gene transcription. 

It is interesting that a greater proportion of genes with increased transcript level 
were in common between PNU-144248E and other treatments known to perturb ergosterol 
biosynthesis than genes with decreased transcript level. While the data are not sufficient 

5 to determine the significance of this difference, it is consistent with the overall patterns 
seen in Figures 5 and 6 and Tables 2-4, which contain a preponderance of genes with 
transcript level increased in response to the treatments used here and relatively few genes 
with decreased transcript level. A number of factors may contribute to this. Genes 
described in the tables and figures were selected because they have been characterized in 

10 other studies, and it is possible that positive changes in transcript level were more likely to 
be detected in these studies than a decrease in level might have been. There are also more 
responsive genes included in the class with increased transcripts, so it is perhaps not 
surprising that numerically more were included in the figures. Finally, it is more 
straightforward to measure an increase in a biological product if the level is changing due 

15 to new synthesis, as would be expected if the level were increasing. The turnover of 

particular transcripts is not well characterized, and differences in transcript levels observed 
may reflect stability and turnover rate in addition to nascent synthesis, especially in 
transcripts seen to decline following treatment 

20 Example 10: Genes of Unknown Function 

Additional genes responsive to perturbation of the ergosterol pathway have been 
identified by the methods of this invention. Of the 1 56 genes found to be upregulated by 
the criteria used herein, 34 are of unknown function and little is known about the majority 
of these genes. Interestingly one of these, COS8, is increased in ten of the eleven 

25 conditions tested and another, AFG1 , is increased in nine conditions. The only other gene 
responding in more than eight conditions was YNL300W, encoding a cell wall component 
with putative GPI anchor (Hamada et a/., 1999, supra). These genes, in conjunction with 
several other highly responsive unknown genes, represent a novel class of genes that may 
play a role in the cellular response to a decline in ergosterol or perturbation of biosynthetic 

30 machinery. These highly responsive unknowns may be indicative of a key compensatory 
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response that, if targeted, could be a point of chemical intervention that would be 
deleterious to the fungal cell. 

The following genes, for which some biological information is available, have 
increased transcript levels: 
5 CLN3, CPA1 , DAK1 , DOT6, DSK2, EN02, HOM3, HSP1 04, LYS20, MET1 7, 

MET6, NTH1, NUD1, PAT1, PDI1, PRY1, PYC2, RPC19, RP021, RPS18B, RPS26A, 
RRP42, SAH1, SAM2, SCC3, SCH9, SPE1, SRA1, SSA1, STP4, TUP1, YLR109W, 
YMR173W, YMR304C, YNL065W, YPR184W 

The following genes of unknown function show increased transcript levels: 
10 AFG1, COS8, HMOl , PTC2, YAR068W, YBL081 W, YBR269C, YCL042W, 

YDL012C, YDL039C, YDL204W, YDR157W, YDR247W, YDR433W, YDR476C, 
YEL007W, YER035W, YFR017C, YGR165W, YGR294W, YHB1, YHR039C, 
YLR100W, YLR251W, YLR356W, YML030W, YMR310C, YNL134C, YNR046W, 
YOL027C, YOR248W, YOR267C, YOR359W, YPL014W, YPL017C, YPL067C, 
15 YPL247C, YPR144C, YPR157W 

The following genes, for which some biological information is available, show a 
decrease in transcript level: 

ADE1, APT1, CMK2, DED1, ECM1, GAR1, HIS5, HSP26, HSP42, HTA2, 
MRT4, OYE2, PHOl 1, PH05, PKH1, RAS2, RKI1, RNR4, RPL39, RRPS25, SML1, 
20 STU1, TIF51B, URA4, YAR073W, YBL005W-b, YBR012W-a, YBR204C, YDR516C, 
YLL025W 

The following genes of unknown function respond with decreased transcript levels: 
YAL008W, YAR003 W, YBR113W, YCL036W, YDR214W, YDR492W, YEL033 W, 
YER067W, YER138C, YER160C, YGL020C, YGR243W, YHL021C, YJL012C, 
25 YJL055W,YmO27W,YLR065C,YMLO39W,YML04OW,YMRO51C,YMR316W, 
YNL087W, YOL002C, YOL014W, YOR175C, YPL019C 

Example 11: Changes in Transcript Level Over Time 

It is likely that responses occurring either earlier or later than the time of exposure 
30 chosen for microarray analysis were not detected. Quantitative polymerase chain reaction 
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(PCR) reactions were used to measure transcript levels from a few genes in the ergosterol 
pathway. This was done to define the time frame of the cellular response to perturbation 
of ergosterol by chemical agents and to confirm changes in transcript levels from 
responsive genes identified by microarray hybridization. 

5 Cultures were grown in the presence of two azoles, ketoconazole and voriconazole, 

and aliquots harvested at 1 , 3, 5, and 24 hrs of exposure. Total RNA was extracted as 
described above and used as a substrate for quantitative PCR, using probe/primer sets to 
detect ERG3, ERG4, ERGS, ERG6, ERG1 1 and ERG24 transcripts. PCR reactions were 
performed using the Taqman Gold PCR kit and analyzed using a Prism 7700 (PE Applied 

10 Biosy stems). 

In this assay, the number of reaction cycles required to raise the level of product 
above a threshold (Q) is a measure of the abundance of the initial template. Threshold 
values are calculated from 10 standard deviations of background. To control for minor 
variance in loading conditions, each PCR reaction also included an internal control 

1 5 reaction, to translation elongation factor TEF 1 , which has relatively constant expression 
levels that are not predicted to vary in response to perturbation of ergosterol (data not 
shown). Data reported for each reaction are given as the difference between experimental 
C t and the C t of the standard gene, TEF1 . The difference between experimental C t and 
control Ct is a measure of the change in abundance of experimental transcript in each 

20 reaction. These values are plotted in Figure 7. An increase in cycle number indicates a 
decrease in transcript abundance. Examination of Figure 7 reveals that the response of 
cells to all four exposures was very similar for the five transcripts measured. 

Probe-primer sets for the ERG3, ERG4, ERG5, ERG6, ERG1 1 and ERG24 were 
identified using Primer Express (PE Applied Biosystems, Foster City, CA). Sequences of 

25 primers and fluorescently labeled probes were: 

5'-CCAAATGGCAATGGAAATCAA (SEQ ID NO:l), 5'- 
ACAAACCATGGAACGGTCAAC (SEQ ID NO:2), 6FAM- 
TTGGCAGTCAGTGCTATCCCATGGATG-TAMRA (SEQ ID NO:3), ERG3; 
5*-TCACTTTGGGTGCCTGTTTG (SEQ ID NO:4), 5'- 

30 ACATGACAACCCCCAATTGTG (SEQ ID NO:5), 6FAM- 



06/24/2004, EAST Version: 1.4.1 



WO 01/21839 



PCTAJS00/26O5S 



34 



TGTCACATAGCCGTAAGTCTCCCACTGCT-TAMRA (SEQ ID NO:6), ERG4^'- 
CGTGCATCAACTTACACCAAGC (SEQ IDNO:7), 5'- 
GTGCTCAAATGGCTAAGGATCAT (SEQ ID N0:8), 6FAM- 
TAACACCAAACTGGCTTACCACCTGCAGCAA-TAMRA (SEQ IDNO:9), ERGS; 
S'-AGACACCTACAGAGGAGGAATCACA (SEQ ID NO:10), 5'- 
GACCACAGATCTCTCAACTACTTCAGA (SEQ ID N0:1 1), 6AM- 
TGCAACCCCAACTCCCAAACCTTTAAGT-TAMRA (SEQ ID NO: 12), ERG6; 
5'-CCACCTCTAGTGTTTTACTGGATTCC (SEQ ID N0:13), 5'- 
CTTCGAAAAACTCGTATGGCTTC (SEQ ID NO: 14), 

6FAM-TACCGTACACAACAGCACTACCGACCCA-TAMRA (SEQ ID NO: 15), 
ERG11; 

5'-ACCACAGGCAATAGACAGTCCATA (SEQ ID NO:16), 5'- 
TG AGCTTTGG AA C GGTATC AAG (SEQ ID NO:17), 

6FAM-TTCACGATTGCCCAGATAGTGTAGCGCA -TAMRA (SEQ IDNO:18), 
ERG24. 

Primers and probes used for TEF1 were: 
5'-GTAGAGTTGAAACCGGTGTCATCA (SEQ ID NO:19), 5'- 
AACGGACTTGACTTCAGTGGTAACA (SEQ ID NO20), 
VIC-CAGGTATGGTTGTTACTTTTGCCCCAGCTG-TAMRA (SEQ ID NO:21). 

Two different probe/primer sets were used to measure ERG1 1 transcripts. These 
are labeled ERG1 1 and ERG1 6 (reflecting alternative names for the same gene) in Figure 
7. The behavior of these two lines in each plot reveals the qualitative nature of this 
experiment, while the ERG1 1 and ERG1 6 response curves were parallel in each 
experiment they differ in magnitude. This most likely reflects differences in the 
efficiencies of the two probe/primer sets. Because of this, the shape of each curve was 
considered more significant than its absolute magnitude. 

Transcripts from ERG3 increase throughout the time course. Other transcripts 
show a transient decrease (as reflected by the upward deflection of the curve, reflecting 
higher cycle number to reach threshold level relative to the internal control). This is 
followed by a recovery, and possible increase in level of transcript over the period 



06/24/2004, EAST Version: 1.4.1 



WO 01/21839 



PCT/US00/26055 



35 



between 5 to 24 hours. Based on this, the microarray samples, taken at 90 minutes of 
exposure, are likely to capture much of the cellular response, at least as reflected by the 
behavior of the set of five genes measured here. 

5 Example 12: Description of a database relating functional annotation to expression 
profile 

Microairay experiments can reveal the transcriptional profile of a cell under a 
given condition. To interpret this profile it is helpful to analyze the pattern of responsive 
genes in the context of the biological pathways altered in response to treatment. Towards 

] 0 this goal a relational database was constructed to connect gene annotations with 

designations on the array. Gene designations from the array were indexed to standard 
names for each gene and the biological role designation for each gene, indexed by 
standard name, was obtained from a licensed database of the yeast literature, YPD. Two 
tables were created in the database: (1) a table relating a biological role to each gene on 

15 the Asymetrix yeast microarray and (2) a table relating a chip feature to each annotated 
gene. The microarray manufacturer provided the first table and the second was 
constructed by eliminating unannotated genes and genes designated to be of unknown 
function, then making multiple entries for genes fulfilling multiple biological roles, and, 
finally, combining related biological roles into groups relating to a particular pathway. 

20 These tables allow one to determine which biological pathways are most responsive to a 
particular treatment 

The YPD database provides a service that allows relational databases to connect 
transcript profiles with functional annotations. The YPD database provides this service. 
Additionally, descriptions of such databases have been published by several labs. Recent 

25 examples include: 

Systematic management and analysis of yeast gene expression data. 
Aach J, Rindone W, Church GM; Genome Research 10, 431-45, 2000 
Department of Genetics, Harvard Medical School, Boston, Massachusetts 021 15 USA, 
and the Lipper Center for Computational Functional Genomics, Boston, Massachusetts 
30 02115. 
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Mining and visualizing large anticancer drug discovery databases. 

Shi LM, Fan Y, Lee JK, Waltham M, Andrews DT, Scherf U, Paull KD, Weinstein JN; 

J. Chem. Inf. Conput. Sci. 40, 367-79, 2000 

Laboratory of Molecular Pharmacology, Division of Basic Sciences, Information 
5 Technology Branch, National Cancer Institute, Bethesda Maryland 20892. 

Example 13: Pathway Annotation 

S. cerevisiae cells were exposed to a variety of treatments and responsive genes 
identified using the method described above for perturbations of ergosterol biosynthesis. 

10 Responsive genes for each treatment were flagged in the database. The table of flagged 
genes was related to the annotated genes in the database and a histogram developed of 
genes in each pathway responsive to each treatment. An example of such a histogram is 
provided in Figure 8. This method has the advantage that the same subset of genes is 
queried for each treatment, and the magnitude of response from each pathway can be 

15 compared without knowledge of the total number of potentially responsive genes in each 
pathway. Such a histogram allows similarity among treatments to be discerned and the 
formation of hypotheses about the pathway or pathways responsive to a particular 
treatment. 

20 Example 14: Mode of Action Prediction 

This method was applied to members of the indazole class of antifungal agents to 
predict the mode of action of this class of compounds. Cells were exposed to half-MIC 
concentrations of four analogs of 3-phenylindazole previously identified by Phil Zaworski 
and Marek Nagiec and synthesized by Fred Ciske and Michael Genin. Cells were grown 

25 in rich medium at 30 degrees with shaking at 225 rpm. Cells were harvested following 
either 1 or 3 hours of exposure, RNA extracted and labeled as described above, and 
hybridized to Affymetrix yeast microarrays. 

In this example data were normalized to transcript profiles of untreated cells 
harvested at either 1 or 3 hours. The intensity measure for each gene was normalized for 

30 chip variation by division with the geometric mean of the distribution of expressed genes 
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on the particular chip, as described above for the ergosterol example. Genes whose 
transcript level increased in response to each of the four treatments at either or both time 
points were considered to be indicators for that treatment class. This set of responsive 
genes consisted of 350 genes. 

5 The set of 350 genes with responsive transcripts contained 68 transcripts whose 

product is related to cell envelope integrity and biosynthesis, 65 transcripts related to bud 
formation, vesicular transport and DNA biosynthesis/checkpoint control, 79 transcripts 
related to osmotic and chemical stress. Taken together, this profile is consistent with loss 
of envelope integrity-resulting from perturbation of microfilament transport or bud 

10 formation. This pattern is consistent with other tests for biological activity of this class of 
antifungals performed here (data not shown). 

In addition to annotated genes, the set of responsive transcripts contained 92 genes 
which were unannotated at the time of this analysis. These are candidates for additional 
components of the indazole-response. Further, this set may contain the molecular target or 

1 5 additional biologically-related targets of indazoles. The set of unannotated indazole- 
responsive genes is given below: 



YAR002W 


YBR007C 


YCL047C 


YCL049C 


YCR061W 


YCR062W 


YCR076C 


YCR095C 


YDL099W 


YDL173W 


YDL177C 


YDL241W 


YDR163W 


YDR383C 


YDR391C 


YDR407C 


YDR428C 


YDR466W 


YDR482C 


YDR504C 


YDR533C 


YER030W 


YER037W 


YER128W 


YFR003C 


YGL072C 


YGL108C 


YGL117W 


YGL174W 


YGR016W 


YGR035C 


YGR042W 


YGR131W 


YGR189C 


YGR243W 


YHR059W 


YIL024C 


YIL117C 


YJL021C 


YJL037W 


YJR054W 


YJR079W 


YJR088C 


YKL052C 


YKLQ54C 


YKL076C 


YKR011C 


YKR089C 


YLR0I1W 


YLR099C 


YLR100W 


YLR194C 


YLR198C 


YLR346C 


YLR414C 


YML041C 


YML053C 


YML131W 


YMR010W 


YMR031C 


YMR098C 


YMR103C 


YMR130W 


YMR136W 


YMR184W 


YMR245W 


YNL133C 


YNL149C 


YNL212W 


YNL258C 
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YNR025C 
YOL101C 
YOR152C 
YPL070W 
YPR105C 



YNR067C 
YOL150C 
YOR189W YPL034W 



YPL071C 
YPR158W 



YOL014W YOL027C 
YOR013W YOR049C 



YPL107W 



YPL052W 
YPL156C 



YOL034W 
YOR051C 
YPL067C 
YPL168W 



All references cited herein are incorporated herein in their entirety by reference. 
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We claim: 

1 . A method of preparing an expression profile of genes expressed by a cell 
treated with an agent, comprising the step of plotting the log(ratio) of normalized filtered 
ADI values resulting from a hybridization assay performed on a microarray with mKNA 

5 extracted from the cell against a sum of the normalized filtered ADI values. 

2. The method of claim 1, wherein the agent is an antibiotic agent. 

3. The method of claim 2, wherein the antibiotic agent is an antifungal agent. 

4. The method of claim 1, wherein the agent has an unknown mode of action. 

5. A method of identifying an agent capable of altering expression of a gene 
10 in a cell, comprising the step of: 

identifying a treatment-affected gene from a prepared expression profile, 
wherein the treatment-affected gene's datapoint on the expression profile exceeds one 
standard deviation from a distribution. 

6. The method of claim 4, wherein the datapoint exceeds two standard 
1 5 deviations from the distribution. 

7. A method of identifying a first agent capable of interfering with a 
biosynthetic pathway, comprising the steps of: 

comparing a plurality of first treatment-affected genes identified from an 
expression profile of a first cell treated with the first agent, with a plurality of second 
20 treatment-affected genes identified from an expression profile of a second cell treated with 
a second agent known to interfere with the biosynthetic pathway; and 

determining whether the number of the first treatment-affected genes which 
are identical to the second treatment-affected genes exceeds a percentage of about 20% of 
the total number of second treatment-affected genes. 
25 8. The method of claim 7, wherein the percentage exceeds about 30%. 

9. The method of claim 7, wherein the percentage exceeds about 40%. 

10. The method of claim 7, wherein the percentage exceeds about 50%. 

1 1 . The method of claim 7, wherein the percentage exceeds about 60%. 

12. The method of claim 7, wherein the percentage exceeds about 70%. 
30 13. The method of claim 7, wherein the percentage exceeds about 80%. 
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14. The method of claim 7, wherein the percentage exceeds about 90%. 

1 5. The method of claim 7, wherein the percentage is about 1 00%. 

16. A method of identifying a gene as part of a biosynthetic pathway 
comprising the step of: 

identifying a treatment-affected gene common to a percentage of 
expression profiles prepared from cells treated with agents known to affect the 
biosynthetic pathway. 

17. The method of claim 16, wherein the percentage is at least about 25%. 

18. The method of claim 16, wherein the percentage is at least about 40%. 

1 9. The method of claim 1 6, wherein the percentage is at least about 55%. 

20. The method of claim 16, wherein the percentage is at least about 75%. 

2 1 . The method of claim 1 6, wherein the percentage is at least about 85%. 

22. The method of claim 1 6, wherein the percentage is about 1 00%. 

23. The expression profile prepared by the method of claim 1 . 

24. The expression profile of claim 23, wherein the cell is a yeast cell, and 
wherein the agent is a compound selected from the group consisting of azole compounds 
and imidazole compounds. 

25. An agent identified by the method of any of claims 4 or 6. 

26. A gene, in isolated form, identified by the method of claim 15. 
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SEQUENCE LISTING 

<110> Fostel, Jennifer M. 
Bammert, Gary F. 
Pharmacia Upjohn 

<120> Gene Expression Profiles 

<130> PUJ0290 

<140> 
<141> 

<150> 60/156 r 050 
<151> 1999-09-24 

<150> 60/209,026 
<151> 2000-06-02 

<160> 21 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Novel Sequence 
<400> 1 

ccaaatggca atggaaatca a 



<210> 2 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Novel Sequence 
<400> 2 

acaaaccatg gaacggtcaa c 



<210> 3 
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<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Novel Sequence 
<400> 3 

ttggcagtca gtgctatccc atggatg 2"? 



<210> 4 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Novel Sequence 



<210> 5 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Novel Sequence 



<210> 6 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Novel Sequence 
<400> 6 

tgtcacatag ccgtaagtct cccactgct 



<400> 4 

tcactttggg tgcctgtttg 



20 



<400> 5 

acatgacaac ccccaattgt g 



21 



<210> 7 



2 
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<2il> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Novel Sequence 
<400> 7 

cgtgcatcaa cttacaccaa gc 22 



<210> 8 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Novel Sequence 
<400> 8 

gtgctcaaat ggctaaggat cat 23 



<210> 9 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Novel Sequence 
<400> 9 

taacaccaaa ctggcttacc acctgcagca a 31 



<210> 10 

<211> 25 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Novel Sequence 
<400> 10 

agacacctac agaggaggaa tcaca 25 



<210> 11 
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<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Novel Sequence 
<400> 11 

gaccacagat ctctcaacta cttcaga 



<210> 12 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Novel Sequence 
<400> 12 

tgcaacccca actcccaaac ctttaagt 



<210> 13 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Novel Sequence 
<400> 13 

ccacctctag tgttttactg gattcc 



<210> 14 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Novel Sequence 
<400> 14 

cttcgaaaaa ctcgtatggc ttc 23 



<210> 15 
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<211> 28 
<2I2> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Novel Sequence 
<400> 15 

taccgtacac aacagcacta ccgaccca 



<210> 16 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Novel Sequence 
<400> 16 

7 A 

accacaggca atagacagtc cata 



<210> 17 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Novel Sequence 
<400> 17 

tgagctttgg aacggtatca ag 22 



<210> IB 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Novel Sequence 
<400> 18 

2 8 

ttcacgattg cccagatagt gtagcgca 



<210> 19 
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<211> 24 
<2I2> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Novel Sequence 
<400> 19 

gtagagttga aaccggtgtc atca 



<210> 20 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Novel Sequence 
<400> 20 

aacggacttg acttcagtgg taaca 25 



<210> 21 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Novel Sequence 
<400> 21 

caggtatggt tgttactttt gccccagctg 30 
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